Reconstructing fragmented gel files created by Model 377 DNA Sequencers.
نویسندگان
چکیده
To our knowledge, the most widely used automated DNA sequencers are manufactured by PE Applied Biosystems (Foster City, CA, USA). Our laboratory uses Model 377 Sequencers equipped with the XL upgrade (i.e., collection software Version 2.0 and analysis software Version 3.0). Power transients, network noise, an excessively fragmented hard disk or other such incidents can cause a sequencer to prematurely stop data collection. If such an incident occurs, the most common practice used to salvage as much of the data as possible is simply to restart the data collection software. As a result of this maneuver, two (or more) gel files are created, each containing fragments of the data from the same DNA samples. In our laboratory, this situation occurred in an experiment in which a sample of pGEM labeled with Energy Transfer Dye Primers (Amersham Pharmacia Biotech, Piscataway, NY, USA) was run. The first of the gel file fragments created was 1624-scans-long, while the second was 6792-scans-long. Figure 1 shows the two electropherograms produced. Note that the first trace fragment (Figure 1A) is completely unusable, while the second (Figure 1B) has five base calls that deviate from the known sequence of pGEM (3). The errors are underlined in the figure. Note that PE Applied Biosystems sequencing analysis software calculates the base-spacing by averaging the peak-to-peak distance in the raw electropherogram between scans 1000 and 2000 relative to the location of the primer peak (2). Thus base-calling from fragmented gel files can be problematic if the analysis software attempts to use this criteria to evaluate base-spacing when the primer peak is not where the analysis software assumes it is. After application of a short C program, called GelWeld, one large gel file was reconstructed from the two gel-file fragments. Figure 2 shows the electropherogram extracted from the reconstructed gel file. Note that the three base-calling errors present in Figure 1B have been corrected. GelWeld works by manipulating the data storage architecture of the Macintosh computer (Apple Computer, Cupertino, CA, USA) in general and of PE Applied Biosystems gel files in particular, both of which have already been discussed elsewhere (1). Specifically, it copies the information resident in the data forks of both the fragment files over the data fork of a third “template” file. The original contents of the template file’s data fork are deleted in the process, so it is important to keep a backup copy of the template file before running GelWeld. GelWeld is available for download at: http://gestec. swmed.edu/gestec2.htm along with a complete set of instructions. A copy of the source code is also provided should
منابع مشابه
Rescuing corrupted gel files from Model 377 and 373 DNA Sequencers.
Automated DNA sequencing requires the intensive use of computers to handle the large amount of data taken. When a computer failure occurs and the data are no longer accessible, all the expense and effort that went into the sequencing experiment is lost. By using the data storage architecture of Macintosh computers to our advantage, we may prevent this loss in the case of automatic sequencers fr...
متن کاملImproving read lengths by recomputing the matrices of Model 377 DNA Sequencers.
To our knowledge, the most widely used automated DNA sequencers in the world are the Model 377 and Model 373 DNA Sequencers manufactured by PE Applied Biosystems (Foster City, CA, USA). These sequencers detect the emission spectra of the four fluorescent dyes in sequencing reactions that are excited by an argon ion laser. Each of the four possible bases are labeled by a different dye, possessin...
متن کاملEvaluation and validation of the ABI 3700, ABI 3100, and the MegaBACE 1000 capillary array electrophoresis instruments for use with short tandem repeat microsatellite typing in a forensic environment.
The demand for high-throughput DNA profiling has increased with the introduction of national DNA databases and has led to the development of automated methods of short tandem repeat (STR) profile production; however, a potential bottleneck still exists at the gel electrophoresis stage. Capillary electrophoresis sequencers capable of processing 96 samples with considerably reduced manual interve...
متن کاملIncreasing sample throughput on a standard 36-lane model 377 DNA sequencer to 48 or 64 samples per run.
During the last decade, automated DNA sequencing systems have started to gradually replace conventional methods of DNA sequencing and fragment analysis. Automated systems offer greater speed and ease of use, as they do not require a separate fragment detection step following electrophoresis. In addition, automated DNA sequencing and fragment analysis systems also offer substantial quality impro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- BioTechniques
دوره 24 6 شماره
صفحات -
تاریخ انتشار 1998